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BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to integrated circuit timing analysis and more 
10 particularly, to abstraction generation for a hierarchical timing analysis. 

Description of the Related Art 

Known very large scale integrated (VLSI) circuit designs, and in particular 
microprocessors designs, often include millions of logic gates which represent 
upwards of 100 million transistors. An important step in the design process of a 

1 5 microprocessor is the performance of static timing analysis, which measures the 
frequency of the design. The static timing analysis enables designers to identify 
critical paths that may require additional design work to meet performance objectives. 
Performing static timing on such large designs at the detailed gate level often results 
in very long run times, significantly slowing the design process and thus potentially 

20 leading to reduced time to market and loss of revenue. 

To improve the runtime of static timing analysis, the timing behavior of a 
given block in the integrated circuit design hierarchy may be represented by a model 
generated from the lower level gates. For example, Figure 1, labeled Prior Art, shows 
a hierarchical diagram of model of an exemplative integrated circuit chip. The model 
25 of the exemplative integrated circuit chip includes a data path 1 10, a control block 
120, a custom block 130, and memory 140. In this model, a custom block 150 and a 
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data path block 160 are abstracted. Providing an abstract timing model in the context 
of a full chip timing run enables more rapid analysis than if the detailed representation 
was used, as well as providing a reduced memory footprint. This approach may also 
be useful when performing timing optimization on part of a design. 

The timing behavior of a block can be represented by different methods. More 
specifically, the timing behavior of a block can be represented as a delay matrix in 
which the delay and slew from each input to each output is explicitly listed. 
Alternately, the timing behavior of a block can be represented as a delay network in 
which a simplified network is provided; the simplified network has the same timing 
behavior as the original network in the hierarchical timing analysis. 

The process of creating a timing model is referred to as abstraction generation 
or timing model generation. One important issue in abstraction generation is path 
extraction. Path extraction relates to the issue of efficiently enumerating paths from 
the inputs to the outputs of a block, often referred to as the timing graph, the signal 
propagation graph or the connectivity graph, and determining the timing behavior of 
the paths represented by the graph. The issue can be formulated as follows: For any 
two nodes, node A and node B in the design, given an input slew and an input 
switching direction at node A, and given a load capacitance at node B, the timing 
behavior should determine the switching direction at node B, and determine delay 
between node A and node B. 

Different approaches have been proposed to address the model generation 
problem. For example, paths may be explicitly enumerated to determine whether 
there is a path from node A to node B, with the path delay computed during tracing. 
Because the number of paths to trace grows exponentially with the number of nodes 
in the design, this approach can lead to prohibitively long runtimes for very large 
circuits. 

Alternately, a series of circuit timing graph transformations may be performed 
by iteratively removing pins and merging arcs until no further change is possible. 
This process yields a compressed delay analysis graph which encapsulates the original 
circuit's timing behavior. However, this technique suffers from complexity in the 
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reduction algorithm; i.e., the process does not lend itself to modeling graph topologies 
containing time borrowing elements that yield topological loops. 

SUMMARY OF THE INVENTION 

The present invention relates to extracting timing model information using an 
underlying static timing analyzer infrastructure based upon the usage of time domain 
signal propagation while performing static timing analysis. More specifically, the 
present invention relates to providing a clock domain injection per primary input or 
pseudo primary input along with the capability of an underlying static timing engine 
to retrieve the minimum and maximum path delay arcs from primary input to flop, 
primary input to primary output, and flop to primary output. The extracted path delay 
arcs are then employed to represent the delay behavior of the macro under analysis, 
and thus to implicitly generate connectivity graph information, without having to 
perform graph tracing or reduction. 

In one embodiment, the invention relates to a method of using static timing 
analysis to extract implicit connectivity graph information. The method includes 
creating a unique clock waveform, defining a clock domain for the clock waveform, 
injecting the clock domain into a control node, propagating the clock waveform from 
the control node to a transitively adjacent observation node, and retrieving transitively 
adjacent control node information to determine path delay information from the 
control node to the transitively adjacent observation node based upon propagation of 
the clock waveform. 

In another embodiment, the invention relates to a system for using static 
timing analysis to extract implicit connectivity graph information. The system 
includes means for creating a unique clock waveform, means for defining a clock 
domain for the clock waveform, means for injecting the clock domain into a control 
node, means for propagating the clock waveform from the control node to a 
transitively adjacent observation node, and means for retrieving transitively adjacent 
control node information to determine path delay information from the control node to 
the transitively adjacent observation node based upon propagation of the clock 
waveform. 
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In another embodiment, the invention relates to an apparatus for using static 
timing analysis to extract implicit connectivity graph information. The apparatus 
includes a clock waveform module, the clock waveform module creating a unique 
clock waveform, a clock domain module, the clock domain module defining a clock 
domain for the clock waveform, an injecting module, the injecting module injecting 
the clock domain into a control node, a propagating module, the propagating module 
propagating the clock waveform from the control node to a transitively adjacent 
observation node, and a retrieving module, the retrieving module retrieving 
transitively adjacent control node information to determine path delay information 
from the control node to the transitively adjacent observation node based upon 
propagation of the clock waveform. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention may be better understood, and its numerous objects, 
features and advantages made apparent to those skilled in the art by referencing the 
accompanying drawings. The use of the same reference number throughout the 
several figures designates a like or similar element. 

Figure 1, labeled Prior Art, shows a block diagram of a hierarchical model of 
an exemplative integrated circuit chip. 

Figure 2 shows an example of the timing of a pair of exemplative signals. 

Figure 3 shows a block diagram of an integrated circuit topology which 
includes control and observation nodes. 

Figure 4 shows a flow chart showing the operation of a path delay extraction 

system. 

Figure 5 shows flow chart of the operation a path delay module. 
Figure 6 shows a flow chart of the operation a timing constraint module. 
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DETAILED DESCRIPTION 

Referring to Figure 2, a domain in static timing analysis is defined to be a 
timing event derived from a clock waveform specification which includes a reference 
time and a direction within a clock cycle. A given clock defines two domains: one for 
5 each clock pulse edge. Each signal in an integrated circuit design is governed by a 
domain. Signals with different domains are kept distinct, i.e., the signals do not 
interact with one another while propagating through nets and combinational logic. 
Combinational edges propagate domains unmodified. Sequential edges do not. 
Sequential flip-flop edges (clock to out) propagate the domain of the triggering clock 

10 edge. Sequential latch-edges (clock-to-out or data-to-out) propagate the domain of 
the opening edge of the clock pin. This is known as clock to data domain conversion. 
Figure 2 shows the creation of clock domains for the clock PHI1, and its logical 
inverse, PHI 2. Where PHI_1 is the master clock definition with event edges el and 
e2 and PHI 2 is a derived clock definition with event edges e3 and e4. Event edges 

15 el and d3 are positive edge events and event edges e2 and e4 are negative edge 
events. 

From the clock edge event definition, timing events known as waves are 
created to represent the clock and data propagation values of arrival time, transition 
time, and switching direction. The propagation of timing events results in the 
20 construction of an implicit, event based, propagation graph for the extraction of 

timing critical paths for model generation, where point to point paths are derivable 
from the domain based analysis graph. 

To determine whether there is a timing path between two nodes, node A and 
node B, a static timing analysis process is performed on the nodes. More specifically, 

25 a set of control nodes in the static timing graph is defined to be either the primary 
inputs or pseudo primary inputs (a pseudo primary input is an output from an 
immediately previous sequential element). A timing domain event can be initiated 
from a primary input or a pseudo primary input. A given control node is identified as 
node A. The set of observation nodes in the timing graph is defined to be either the 

30 primary outputs of the design under analysis, or the input of a sequential element. A 
given observation node is identified as node B. 
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Sequential elements have a dual function. The sequential elements may 
function as a control node and as an observation node (an input of a sequential 
element may function as an observation node while the output of the sequential 
element may function as a control node. The analysis graph is topologically sorted 
such that nodes are in an ordered fashion yielding an acyclic analysis graph which can 
be transversed from left to right. 

Figure 3 shows the concept of transitively adjacent control and observation 
nodes for an example design. Control nodes 3 10 are represented as opaque circles 
and observation nodes 3 12 are represented as black circles. The lines and arrows 
represent potential paths between the transitively adjacent control nodes 3 10 and 
observation nodes 312 that are uncovered by the abstraction generation process in 
accordance with the present invention. 

More specifically, the path between a control node 3 10 and an observation 
node 312 may be via a direct path as represented by line 320, thus these nodes are 
transitively adjacent. The path between a control node 310 and a transitively adjacent 
observation node 312 may be via combinational logic 322, via a flop 324 or via some 
combination of combinational logic 322 and flops 324. Also, some observation nodes 
may also provide a pseudo control node when passing via a sequential element. For 
example, the observation node at the input of flop 324a provides a pseudo control 
node which is the input to combinational logic 322b. 

Referring to Figure 4, a flow chart showing the operation of a abstraction 
generation system 400 is shown. More specifically, the method starts by identifying 
all observation and control nodes at step 405 and proceeds with creating a unique 
clock i for each control node i at step 410. Next, two domains are defined for each 
clock i at step 420. Domain i+ is defined by rising edge of clock i, and domain i- is 
defined by falling edge of clock i. The static timing engine then propagates timing 
events created with the unique clock domain from each control node i at step 430. 

Next, at step 440, the method determines whether, for each observation node j, 
there is an arrival wave of domain i+ and i-. 
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By using a static timing engine, unique paths are implicitly enumerated from 
each control node to the set of affected observation nodes. Thus, at step 460 the 
method recovers the path delay from each control node to the set of transitively 
adjacent observation nodes such that the path delay is recovered between node i and 
nodej. 

Once the path is extracted for a pair of control node i and observation nodej, 
timing constraints are computed at step 470. The timing constraints may include 
setup constraints or hold constraints. After the timing constraints are computed, the 
execution of the abstraction generation system 400 completes. 

Referring to Figure 5, a flow chart of the operation a path delay module 500 of 
the abstraction generation system 400 in recovering path delays is shown. A plurality 
of different types of path delays may be recovered. The module 500 begins execution 
at step 510 by calculating a maximum delay from each control node to the set of 
adjacent observation nodes such that the path delay between node i and nodej is 
computed as: 

MaxM(iJ) R _ R = a LR _ i+ (J)-a LR _ i+ (i) 

where MaxAi(i, j) R _ R is the maximum delay from node i rising to node j 

rising; 

a LR-i+ (J) is the latest arrival time at nodej rising from domain i+ if it is 
present at nodej; 

(0 is the launch time of domain i+ at node i. 

Next the abstraction generation system determines whether to calculate 
another maximum path delay at step 520. Other types of maximum path delays 
include, e.g., the maximum delay from node i rising to node j falling, the maximum 
delay from node i falling to nodej rising, and the maximum delay from node i falling 
to nodej falling. If other maximum delays are to be calculated, then control returns to 
step 510 and these maximum delays are calculated as the maximum path delay 
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between node i rising to node j rising was calculated. If no other maximum delays are 
to be calculated, then control proceeds to step 530. 

The minimum delay is calculated at step 530. For example, the minimum 
delay from node i to node j can be computed as follows: 

MinM{iJ) R _ R = a ERi+ (j)-a ERi+ (i) 

where MinAi(i, j) R _ R i s the minimum delay from node i rising to node j 

rising; 

a ER-i+ 0) is the earliest arrival time a node j rising from domain i+ if it is 
present at node j; 

^ER i+ (0 is the launch time of domain i+ at node i. 

Next, the abstraction generation system determines whether to calculate other 
minimum delays at step 540. Other types of minimum path delays, e.g., the minimum 
delay from node i rising to node j falling, the minimum delay from node i falling to 
node j rising, and the minimum delay from node i falling to node j falling. If other 
minimum delays are to be calculated, then control returns to step 510 and these 
minimum delays are calculated as the minimum path delay between node i rising to 
node j rising was calculated. 

If no other minimum delays are to be calculated, then the execution of the path 
delay module completes. 

Referring to Figure 6, a flow chart of the operation a timing constraint module 
600 of the abstraction generation system in computing timing constraints is shown. In 
timing model generation, once the path is extracted for a pair of control node i and 
observation node j, timing constraints can be computed. 

The timing constraints computed by the timing constraint module 600 include 
setup constraints and hold constraints. The timing constraints at node i can be 
computed from node j, if there is a path from node i to node j, and there is a pre- 
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defined timing constraint at node j. Node j may have a pre-defined timing constraint, 
such as a setup check or a hold check at an input to a flop, or at an input to a latch. 

Step 610 calculates a setup constraint at node i as follows: 
Setup(i,clk) R _ R = MaxAi(i,j) R _ R + Setup(j,clk) R _ R 

5 where Setup(i, elk) R _ R is the setup time at node i rising relative to clock 

signal elk rising; 

MaxAi(i, j) R _ R is the maximum path delay from node i rising to node j 

rising; 

Setup^j, dk) R _ R is the setup time at node j rising relative to clock signal elk 

10 rising. 

Because there may be multiple paths between node i and node j for which the 
setup time is calculated, the module then checks to determine whether to calculate a 
setup time for another path at step 620. The setup time at node i can be computed 
from multiple paths from node i to transitively adjacent, and the maximum value (i.e., 
1 5 the most stringent setup time) is reported as setup constraint at node i at step 630. 

Step 640 calculates the hold constraint at node i as follows: 
Hold (i, elk) R _ R = Hold (j \clk) R _ R —MinAi(i, j) R _ R 

where Hold(i,clk) R „ R is the hold time at node i rising relative to clock signal 
elk rising; 

20 Hold(j\clk) R _ R is the hold time at node j rising relative to clock signal elk 

rising. 

MinAi{iJ) R _ R is the minimum path delay from node i rising to node j 

rising; 
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Because there may be multiple paths between node i and node j for which the 
hold time is calculated, the module 600 then checks to determine whether to calculate 
a hold time for another path at step 650. The maximum value for the calculated hold 
time (i.e., the most stringent hold time) is reported as the hold constraint at node i at 
step 660. 

After the maximum value of the calculated hold time is reported, then the 
execution of the timing constraint module 600 completes. 

The present invention is well adapted to attain the advantages mentioned as 
well as others inherent therein. While the present invention has been depicted, 
described, and is defined by reference to particular embodiments of the invention, 
such references do not imply a limitation on the invention, and no such limitation is to 
be inferred. The invention is capable of considerable modification, alteration, and 
equivalents in form and function, as will occur to those ordinarily skilled in the 
pertinent arts. The depicted and described embodiments are examples only, and are 
not exhaustive of the scope of the invention. 

Also, for example, the above-discussed embodiments include software 
modules that perform certain tasks. The software modules discussed herein may 
include script, batch, or other executable files. The software modules may be stored 
on a machine-readable or computer-readable storage medium such as a disk drive. 
Storage devices used for storing software modules in accordance with an embodiment 
of the invention may be magnetic floppy disks, hard disks, or optical discs such as 
CD-ROMs or CD-Rs, for example. A storage device used for storing firmware or 
hardware modules in accordance with an embodiment of the invention may also 
include a semiconductor-based memory, which may be permanently, removably or 
remotely coupled to a microprocessor/memory system. Thus, the modules may be 
stored within a computer system memory to configure the computer system to 
perform the functions of the module. Other new and various types of computer- 
readable storage media may be used to store the modules discussed herein. 
Additionally, those skilled in the art will recognize that the separation of functionality 
into modules is for illustrative purposes. Alternative embodiments may merge the 
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functionality of multiple modules into a single module or may impose an alternate 
decomposition of functionality of modules. For example, a software module for 
calling sub-modules may be decomposed so that each sub-module performs its 
function and passes control directly to another sub-module. 

Consequently, the invention is intended to be limited only by the spirit and 
scope of the appended claims, giving full cognizance to equivalents in all respects. 



